Mining of health and disease events on Twitter: validating search protocols within the setting of Indonesia

نویسندگان

  • Aditya L. Ramadona
  • Rendra Agusta
  • Sulistyawati
  • Lutfan Lazuardi
  • Anwar D. Cahyono
  • AAsa Holmner
  • Fatwa S.T. Dewi
  • Hari Kusnanto
  • Joacim Rocklov
چکیده

This study seeks to validate a search protocol of ill health-related terms using Twitter data which can later be used to understand if, and how, Twitter can reveal information on the current health situation. We extracted conversations related to health and disease postings on Twitter using a set of pre-defined keywords, assessed the prevalence, frequency, and timing of such content in these conversations, and validated how this search protocol was able to detect relevant disease tweets. Classification and Regression Trees (CART) algorithm was used to train and test search protocols of disease and health hits comparing to those identified by our team. The accuracy of predictions showed a good validity with AUC beyond 0.8. Our study shows that monitoring of public sentiment on Twitter can be used as a real-time proxy for health events.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Examination of Emergency Medicine Physicians’ and Residents’ Twitter Activities During the First Days of the COVID-19 Outbreak

Introduction: Social media has become an important element of interaction and found itself a place in every aspect of our lives. This study examined the twitter activities of emergency medicine physicians and residents (EMP&R;) about the COVID-19 outbreak. Methods: The study concentrated on Twitter, a major social media network. To identify accounts owned ...

متن کامل

Factors Involved in Missed Nursing Care: A Systematic Review

Background. Missed Nursing Care (MNC) is experienced in nearly all health care facilities. Awareness of the aspects involved in the occurrence of MNC can lead to the improvement of the quality of patient care. This systematic review aims to answer the question: "What factors are involved in the incidence of missed nursing care?"   Methods. This systematic review follows the Preferred Reporting...

متن کامل

A High-Performance Model based on Ensembles for Twitter Sentiment Classification

Background and Objectives: Twitter Sentiment Classification is one of the most popular fields in information retrieval and text mining. Millions of people of the world intensity use social networks like Twitter. It supports users to publish tweets to tell what they are thinking about topics. There are numerous web sites built on the Internet presenting Twitter. The user can enter a sentiment ta...

متن کامل

2016 Olympic Games on Twitter: Sentiment Analysis of Sports Fans Tweets using Big Data Framework

Big data analytics is one of the most important subjects in computer science. Today, due to the increasing expansion of Web technology, a large amount of data is available to researchers. Extracting information from these data is one of the requirements for many organizations and business centers. In recent years, the massive amount of Twitter's social networking data has become a platform for ...

متن کامل

Sociological Public Perception of the Coronavirus Disease 2019 Situation in Indonesia: A Phenomenological Study

Background: The rapid spread of the coronavirus disease 2019 (COVID-19) has caused many fatalities in Indonesia. This condition has triggered social changes that were never imagined before. There is no research that explores the public's perception of the sociological situation. Purpose: To sociologically explore community perceptions of the COVID-19 pandemic situation in Indonesia Methods: P...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016